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In this paper, we introduce the new concept of state complexity approximation, which is a further 
development of state complexity estimation. We show that this new concept is useful in both of the 
following two cases: the exact state complexities are not known and the state complexities have been 
obtained but are in incomprehensible form. 

1 Introduction 

The state complexity of combined operations has been studied in, e. g., |[T2l l4ll3l. It has been shown that 
the state complexity of combined operations is at least as important and practical as the state complexity 
of individual operations. There is only a limited number of individual operations on regular languages. 
However, the number of combined operations on regular languages is unlimited and each of them is not 
simply a mathematical composition of the state complexities of their component individual operations. 
It appears that the exact state complexity of each combined operation has to be studied specifically. 

There are at least the following two problems concerning the state complexities for combined op- 
erations. First, the state complexities of many combined operations are extremely difficult to compute. 
Second, a large proportion of results that have been obtained are pretty complex and impossible to com- 
prehend. For example, the state complexity of the catenation for four regular languages accepted by 
m,n,p,q states, respectively, is 



It is clear that close estimations of state complexities are good enough in many automata applications. 
In lfl3l l3l. estimations of state complexity of combined operations have been proposed and studied. In 
this paper, we go further in the direction of the study in lfT3l and introduce the concept of state 
complexity approximation. Briefly speaking, an approximation of a state complexity is an estimate of 
the state complexity with a ratio bound clearly defined. The ratio bound gives a precise measurement on 
the quality of the estimate. 

The idea of state complexity approximation is from the notion of approximation algorithms which 
was formalized in early 1970's by David S. Johnson et al. |5J|9j[T0]]. Many polynomial-time approxima- 
tion algorithms have been designed for a quite large number of NP-complete problems, which include 
the well-known travelling-salesman problem, the set-covering problem, and the subset-sum problem. 
Obtaining an optimal solution for an NP-complete problem is considered intractable. Near optimal solu- 
tions are often good enough in practice. Assuming that the problem is a maximization or a minimization 
problem, an approximation algorithm is said to have a ratio bound of p(n) if for any input of size n, the 
cost C of the solution produced by the algorithm is within a factor of p(n) of the cost C* of an optimal 
solution HI: 
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The concept of state complexity approximation is in many ways similar to that of approximation 
algorithms. A state complexity approximation is close to the exact state complexity and normally not 
equal to it. The ratio bound shows the error range of the approximation. In addition to the property of 
having a small ratio bound in general, we also consider that a state complexity approximation should be 
in a simple and intuitive form. 

In spite of the similarities, there are fundamental differences between a state complexity approxima- 
tion and an approximation algorithm. The efforts in the area of approximation algorithms are in finding 
polynomial algorithms for NP-complete problems such that the results of the algorithms approximate the 
optimal results. In comparison, the efforts in the state complexity approximation are in searching directly 
for the estimations of state complexities such that they satisfy certain ratio bounds. The aim of designing 
an approximation algorithm is to transform an intractable problem into one that is easier to compute and 
the result is acceptable although not optimal. In comparison, a state complexity approximation result 
may have two different effects: (1) it gives a reasonable estimation of certain state complexity, with 
some bound, the exact value of which is difficult or impossible to compute; or (2) it gives a simpler and 
more comprehensible formula that approximates a known state complexity. 

In the next section, we give some basic definitions and notation including the formal definition of 
state complexity approximation. In Section 3, we show the state complexity approximation results on 
four basic combined operations: the star of union, the star of intersection, the star of catenation, and 
the star of reversal. In Section 4, we show that state complexity approximation results can be easily 
obtained for some operations the exact state complexities of which may be very difficult to obtain. In 
Section 5, we show that certain state complexity can be very complex in formulation. A state complexity 
approximation is clearly more intuitive and comprehensible than the exact state complexity. In Section 6, 
we conclude the paper. 

2 Preliminaries 

A deterministic finite automaton (DFA) is denoted by a 5-tuple A = (Q,L,5,s,F), where Q is the finite 
and nonempty set of states, £ is the finite and nonempty set of input symbols, 5 : Q x £ — » Q is the state 
transition function, s € Q is the initial state, and F C Q is the set of final states. A DFA is said to be 
complete if S is a total function. Complete DFAs are the basic model for considering state complexity. 
Without specific mentioning, all DFAs are assumed to be complete in this paper. 

A nondeterministic finite automaton (NFA) is also denoted by a 5-tuple M = (Q,l.,S,s,F), where 
Q, £, s, and F are defined the same way as in a DFA and 5 : Q x £ — > 2^ maps a pair consisting of a 
state and an input symbol into a set of states rather than, more restrictively, a single state. An NFA may 
have multiple initial states, in which case an NFA is denoted (Q,L,5,S,F) where S is the set of initial 
states. 

The reader may refer to HHTTJEl ^ or a ratner complete background knowledge in automata theory. 

State complexity (CHI) is a descriptional complexity measure for regular languages based on the de- 
terministic finite automaton model. So, by state complexity we mean the deterministic state complexity. 

The state complexity of a regular language L, denoted sc(L), is the number of states in the minimal 
complete DFA accepting L. When we speak about the state complexity of a (combined) operation on 
regular languages, we mean the worst case state complexity of the languages resulting from the operation 
as a function of the state complexity of the regular operand languages. So, without specific mentioning, 
by state complexity we mean the worst-case state complexity in the following. 

If the above definition is based on minimal NFA rather than minimal complete DFA, we have the 
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nondeterministic state complexity, which has been studied in (6J, |7] . 

Let £ be a combined operation on k regular languages. Assume that the state complexity of £ is 9. 
We say that q is a state complexity approximation of the operation £ with the ratio bound p if, for any 
large enough positive integers ri\,... ,n k , which are the numbers of states of the DFAs that accept the 
argument languages of the operation, respectively, 

/ a(m,...,n fc ) 6(n u ...,n k )\ 

max 757 7w \ < P{m,...,n k ). 

\9(m,...,n k ) a{m,...,n k )J 

Note that in many cases, p is a constant. Since state complexity is a worst-case complexity, an ap- 
proximation that is not smaller than the actual state complexity is preferred, which is the case for every 
approximation result in this paper. 



3 Some basic results on state complexity approximation 

In lfT3ll . an estimation method through nondeterministic state complexities was introduced for the (de- 
terministic) state complexities of certain types of combined operations. The method is described in the 
following. 

Assume we are considering the combination of a language operation g\ with k arguments together 
with operations g\, i = 1, . . . , k. The nondeterministic estimation upper bound, or NEU-bound for the 
deterministic state complexity of the combined operation g\(g\,.. . ,g%) is calculated as follows: 

(i) Let the arguments of the operation g\ be DFAs Aj with m* states, % = 1, . . . ,k, j = 1, . . . ,rj, rj > 1. 

(ii) The nondeterministic state complexity of the combined operation is at most the composition of the 
individual state complexities, and hence the language 

5l (^(L(^!),...,L(^J),..., 5 f(L(4),...,L(<))) 

has an NFA with at most 

nsc(0i)(nsc(s])(m}, . . . ,m l ri ), . . . ,nsc(flf )(mf , ■■■,m k Tk )) 

states, where nsc(g) is the nondeterministic state complexity (as a function) of the language oper- 
ation g. 

(iii) Consequently, the deterministic state complexity of the combined operation gi [g\ , . . . , g\ ) is upper 
bounded by 

2nsc(g 1 )(nsc(^)(mj,...,m|. I ),...,nsc(3* : )(m{ : ,...,mJ: fc )) ^ 

The nondeterministic state complexity of the basic individual operations on regular languages has 
been investigated in EUTHH. 

In the following we show that this estimation method can produce nice approximation results for the 
state complexities of certain combined operations. The table below shows the actual state complexities 
and their corresponding NEU-bounds of the four combined operations |[T3l : (1) star of union, (2) star of 
intersection, (3) star of catenation, and (4) star of reversal. 



Operations 


State Complexity 


NEU-bound 


(L(A)UL(B)r 




/ 2jn-\-n-\-2 


(L(A) n L(B))* 


3/4 2 mn 




(L(A)L(B))* 


2 m+n_1 _|_2 m +™- 4 _2 m_1 -2 n_1 + m+ 1 




(L(B) R r 


2" 


2 n+2 
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The next table shows clearly that each NEU-bound in the previous table gives a very good approxi- 
mation to its corresponding state complexity. 



Operations 


Ratio bounds of the approximation 


(L(A)UL(B)y 


w8 


(L(A) L(B))* 


8/3 


(L(A)L(B))* 


4 


(MB) R r 


4 



In the above cases, although the exact state complexities have been obtained, the approximation re- 
sults with small ratio bounds are good enough for practical purposes, and they clearly have the advantage 
of being more intuitive and simpler in formulation. 

4 Approximation without knowing actual state complexity 

In this section, we consider two combined operations: (1) star of left quotient and (2) left quotient of 
star. For each of the combined operations, we do not have the exact state complexity; however, an 
approximation with a good ratio bound is obtained. 

Let R and L be two languages over the alphabet L. Then the left quotient of R by L, denoted L\R, 
is the language 

{y \xy £ R and x G L}. 
In the following, we assume that all languages are over an alphabet of at least two letters. 

4.1 The state complexity approximation of star of left quotient 

Theorem 1. Let Rbe a language accepted by an n-state DFA M, n > 0, and L be an arbitrary language. 
Then there exists a DFA of at most 2 n states that accepts (L\R)*. 

Proof: Let M = (Q,L,5, s,F) be a complete DFA of n states and R = L(M). For each q € Q, denote 
by L(M q ) the set {w £ L*\5(s,w) = q}. We construct an NFA M' with multiple initial states to accept 
(L\R) + as follows. M' is the same as M except that the initial state s of M is replaced by the set of 
initial states S = {q\L(M q ) n L / 0} and e-transitions are added from each final state to the states in S. 
By using subset construction, we can construct a DFA A' of no more than 2 n — 1 states that is equivalent 
to M'. Note that is not a state of A'. From the DFA A', we construct a new DFA A by just adding a 
new initial state that is also a final state and the transitions from this new state that are the same as the 
transitions from the original initial state of A'. It is easy to see that L(A) = (L\R)* and A has 2 n states. 

□ 

This result gives an upper bound for the state complexity of the combined operation: star of left 
quotient. 

Theorem 2. For any integer n>2, there exist a DFA M of n states and a language L such that any 
DFA accepting (L\L(M))* needs at least 2 n ~ x +2 n ~ 2 states. 

Proof: For n = 2, it is clear that R = {w E {a, b}* \# a (w) is odd} is accepted by a two-state DFA, and 



({s}\R)*=R* = {e}U{w G {a,b}*\# a (w) > 1} 
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cannot be accepted by a DFA with less than three states. 

For n > 2, let M = (Q,{a,b},S,0, {n — 1}) where Q = {0, 1, . . . ,n — 1}, 5(i,a) = i + 1 mod n, 
i = 0, 1, ... ,n— 1, 5(0, a) = and <5(j,6) = j + 1 mod n, j = 1, ... ,n — 1. 

It has been proved in [15] that the minimal DFA accepting L(M)* has 2 n_1 +2 n ~ 2 states. Let 
L = {e}. Then (L\L(M))* = L(M)*. So, any DFA accepting (L\L(M))* needs at least 2"" 1 +2 n ~ 2 
states. □ 

This result gives a lower bound for the state complexity of star of left quotient. Clearly, the lower 
bound does not coincide with the upper bound. We still do not know the exact state complexity for this 
combined operation, yet, which could be difficult to obtain. However, we can easily obtain a good state 
complexity approximation for the operation. Let 2 n the approximation. Then the ratio bound would be 

2 n _ 4 

In- 1 _i_ On— 2 a 



4.2 The state complexity approximation of left quotient of star 

Here we consider the combined operation: left quotient of star. 

Theorem 3. Let R be a language accepted by an n-state DFA M and L an arbitrary language. Then 
there exists a DFA of at most 2 n+1 — 1 states that accepts L\R*. 

Proof: Let M=(Q,E, 5, s, F) be a complete DFA of n states and R=L(M). Then we can easily construct 
an (n + l)-state NFA M' = (QU {s'},L,5' ,s f ,FU {s'}) such that L(M') = R* by adding anew initial 
state s' and transitions 5'(s',e) = s and 5'(f,e) = s' for each final state / G F. For each q G Q U {«'}, 
we denote by L(M q ) the set {w E £ 6'(s',w)}. We construct an NFA N with multiple initial states 
to accept L\L{M') = L\R* as follows. N is the same as M' except that the initial state s' of M' is 
replaced by the set of initial states S = {q\ L(M p )nL ^ 0}. By using subset construction, we can verify 
that there exists a DFA A of no more than 2 n+l — 1 states that is equivalent to N. Note that is not a 
state of A. It is easy to see that 

L(A) = L(N) = L\L(M') = L\R*. 
So, 2" +1 — 1 is an upper bound of the state complexity of left quotient of star. □ 



Theorem 4. For any integer n > 2, there exist a DFA M of n states and a language L such that any 
DFA accepting L\L(M)* needs at least 2™" 1 + 2 n ~ 2 states. 

Proof: For n = 2, we still use R = {w € {a,6}*|# a (u>) is odd} which is accepted by a two-state DFA. 
{e}\ii* = R* cannot be accepted by a DFA with less than three states. 

Again we use the same DFA M defined in the proof of Theorem [2] for any integer n > 2. As stated 
before, it has been proved that the minimal DFA accepting L(M)* has 2 ra_1 + 2 n_2 states. So any DFA 
accepting L\L(M)* needs at least 2 n_1 + 2 n " 2 states. □ 

For this combined operation, we choose 2 n+i to be an approximation of its state complexity. Then 
the ratio bound can be calculated easily as follows: 

2 n+1 _ 8 
2n-\ + 2 n - 2 ~ 3 
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5 State complexity approximation of the catenation of regular languages 

As we know, the state complexity of the catenation of an n\ -state DFA language and an restate DFA 
language, n\ > 1 and n 2 > 2, is n{l ni — 2 n2_1 ( |fT5Tl ). The state complexity of multiple catenations has 
been studied in [3] and the following estimate was obtained. 

Claim 1. Let R\ , . . . , R^, k > 2, be regular languages accepted by DFAs ofni, ... ,rik states, respectively. 
Then the state complexity of R\ ■■ ■ Rk is no more than 

ni 2 n 2H Hn fe _2 n 2H Hrifc-1 _ 2 n3H ^ ra fc _1 2 nfc_1 

The exact state complexity of the catenations of three and four regular languages was also obtained 
in 0. In this section, we prove the exact state complexities of the catenation of k regular languages 
for arbitrary k > 2. Note that this is not a state complexity in the normal definition that is for only one 
specific (combined) operation. This is a state complexity (formula) for a class of (combined) operations. 

After we prove this state complexity, we show an approximation of the complexity and state why the 
approximation is useful in this case. 

We first consider a lower bound. 
Theorem 5. For any integers n\ > 2, 1 < i < k, there exist DFA Ai of n{ states, respectively, such that 
any DFA accepting L(A\) ■ ■ ■ L(A/,) needs at least 

fc-i 

ni2 ni+-+n k _ D _ 

i=\ 

states, where 

D = n l {2 ni+ - +n " - 1) +rai(2™ 2 - \)(2 n4+ - +rik - 1) + • • • + m(2 n2 - 1) • • • (2 nfc ~ 2 - l)(2 nk - 1); 
Ei = l + (2 n2 - 1 -l)(l + (2 n3 -l)(l + (2 n4 -l)---(l + (2 nfc - 1 -l)2 nfc )...)); 
E 2 = (m - l)2" 2 - 1 (l + (2™ 3 - 1 - 1)(1 + (2"' 4 - 1) • • • (1 + (2" fc -' - l)2" fc ) . . .)) 
+ 2 n2 ~ 2 (l + (2" 3 - 1 - 1)(1 + (2" 4 - 1) ... (1 + (2 nfc -' - l)2 nfc ) . . .)); 

Ei = (m - l)(2 n2 " 1 - 1) • ■ -2 n *-\\ + (2 n >+'- 1 - 1)(1 + (2 n « - 1) ••• (1 + (2 nfc -' - \)2 rik ) . . .)) 
+ . . . + 2™ 2 " 2 • • • 2 ni - 2 (l + (2 n »+'- 1 - 1)(1 + (2™'+ 2 - 1) ••• (1 + (2 nk ~ l - l)2 nfc ) . . .)). 
Proof: Let I = {aj \ 1 < j < 2k - 1}. Define a DFA A x = (Qi,£,<5i,0,Fi), where 

Qi = {0,l,...,m-1}, 
Fi = {ni - 1}, 
8\{t,a\) = t+\ mod n\, < t < n\ — 1, 
Si (t, a 2 k-2) =0, < t < n\ - 1, 

6\(t,b) =t, b € £— {ai,a%k-2\i < t < n\ — 1. 
Let DFA A i = (Q i ,'L,Si,0,Fi},2<i<k, where 

Qi = {0, l,...,m- 1}, 

F = {»*-!}, 

fl2i-2) = t + 1 mod < t < rii — 1, 

<5i(£,a2i-i) = L < t < m- 1, 

=t, 6 G I- {a 2 i_2,a2i-i}, 0<t <n»-l. 
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For each x G {a\ , 02, 04, ... , 0,2*1-2}* and 2 < s < k, we define 

P s (x) = {p I x = u\U2-..u s , ui G L(Ai),\ <l<s-\, andp = # a2s _ 2 (u s ) mod n s }. 

Consider that x,y G {ai ,02, 04, ... ,02^.-2}* such that P s (x) / P s {y)- Let c G P s (x) — P s (y) (or 
P s (y) — P s (x)) and u; = a^^^^s+io^' -1 ' '' a 2k-i<hk-2 - Then it is clear that xw G L{A{) • ■ ■ L(Ak) 
but yu; ^ i(-Ai) • • • L(Ak). So, x and y are in different equivalence classes of the right-invariant relation 
induced by L(A\) ■ ■ ■ L(A k ). 

For each x G {ai ,02,04, . . . ,02fc-2}*, define 

P\{x) = # ai (z) where x = ydz, y G {a u a 2 ,a 4 , . . . ,a 2 k-2}* , 

z G {01,02,04, ... ,a 2 fc-4}*, if a2fc-2 occurs in x; 
P\{x) = # ai (x), otherwise. 

Consider u,v G {01,02,04,..., a2fe-2}* such that P\ (u) mod ni > P\(v) modrii. 
Let i = P\(u) mod n\ and w = a™ 1-1-1 ^^ 2-1 ■ ■ ■ a2k-ia>2k-2- Then clearly uw € L{A\)- ■ ■ L{A k ) but 
vw£L{A l )---L(A k ). 

Notice that there does not exist a word w such that ^ Pi(w) and P\(w) = n\ — 1, since 
P\(w) = n\ — 1 guarantees that G P2(w). Because of the same reason, there does not exist a word w 
such that n t -le Pt(w) and P t +i(w), 2 < t < k — 1. It is also impossible that Pt(w/) = but 

Pt+i(u>)^0. 

For each subset p s = {di )S , . . . ,d esjS } of {0, . . . ,n s — 1} where di jS > • • • > d 6s)S and 2 < s < k, and 
an integer p\ G {0, . . . ,ni — 1}, except the cases we mentioned above, there exists a word 



ni n? «3 

Oj a 2 a 4 



ni n? m 
Oj fl2 4 • • 

ni di,2 — <h,2 

a, a 



a 2k-4 a 2k-2 a \ a 2 a 4 

n k — l t ^ e k^ l ' k ^ < ^ e k- k n l n 2 n 3 n k — 1 ^ e fc' fc 

2fc-4 a 2£i-2 a l fl 2 a 4 " ' a 2fe-4 a 2fc-2 

di,fc-i-<fc,fc-i „ n l„ n 2 ™3 ^^fc-i'*- 1 
a 2fc-4 " ' U l U 2 "4 " ' a 2fc-4 

ni ^ei. 2 Pi 

■•Oj a 2 a j . 



1 "2 

such that Pi (x) = p\ and P s (x) = p s 

In total, there are ni2 n2 2™ 3 • • -2 nk classes. There are 

D = ni(2 n3+ - +rife - 1) + ni(2™ 2 - l)(2 n4+ - +n * - 1) + • • • + rai(2™ 2 - 1) • • • (2™ fc - 2 - l)(2" fc - 1) 
classes with both p t = and p t+ \ ^0, 2<t<k — 1. There are 

El = (1 + (2" 2_1 - 1)(1 + (2™ 3 - 1)(1 + (2 n4 - 1) • • • (1 + (2™ fe -' - l)2 nfc ) . . .)) 
classes with both p\ = n\ — 1 and ^ p2- There are 

£ 2 = (ni-l)2 ri2 ' 1 (l + (2 ri3 - 1 -l)(l + (2 ri4 -l)---(l + (2 rifc -'- 1 )2 nfc )...)) 
+ 2 n2 ~ 2 (l + (2" 3 - 1 - 1)(1 + (2™ 4 - 1) • • • (1 + (2" fc -' - l)2" fc ) . . .)) 

classes with both 712 — 1 G P2 and ^ p$, which are not in E\. We omit the other similar classes until the 
2th group of classes. There are 

Ei = (ni - l)(2 n2 ~ 1 - 1) • • ■2 ni_1 (l + (2 n ^ 1 " 1 - 1)(1 + (2 n< + 2 - 1) • • • (1 + (2 nfc -> - l)2 nfc ) . . .)) 
+ . . . + 2" 2 - 2 • • .2"^ 2 (1 + (2" l +'- 1 - 1)(1 + (2 n ^ 2 - 1) • • • (1 + (2 nk - i - \)2 nk ). . .)) 

classes with both rij — 1 G pi and ^ Pi+i, which are not in E\,E2,. . . ,Ei-\. 

Thus, there are at least ni2 n2+ - +nfc — D — E\ — E2 Pfc-i distinct equivalence classes. □ 
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Theorem 6. Let A it 1 <i < k be k DFAs of n it respectively, where A,- L has final states, < fi < 
Then there exists a DFA of 



fc-1 

n 1 2 n2+ - +nk -D- 1 £E, l 
j=i 



states that accepts L(A\) ■ • ■ L{A k ), where 

D = n 1 (2" 3+ - + " fc - 1) + ni (2 n2 - i)(2™ 4 +-+"'= - 1) + . . . + m (2™ 2 - 1) • • • (2™ fc - 2 - l)(2 rlfc - 1); 
Ei = /i(l + (2" 2 " 1 - 1)(1 + (2™ 3 - 1)(1 + (2" 4 - 1) ■ ■ ■ (1 + (2" fc -' - l)2" fc ) . . .)); 
E 2 = (ni - /i)(2 /2 - 1)2™ 2 " /2 (1 + (2" 3 " 1 - 1)(1 + (2™ 4 - 1) ■ • ■ (1 + (2"*-' - 1)2™") . . .)) 
+/i(2 /2 - 1)2™ 2 " /2 " 1 (1 + (2™ 3 " 1 - 1)(1 + (2" 4 - 1) ■ • ■ (1 + (2 nk -' - l)2 nk ) . . .)); 

E l = (ni - /i)(2™ 2 ~ /2 -!)••• (2 fi - l)2" i_/i (l + (2 ni+1 ~ 1 - 1)(1 + (2™ l + 2 -!)••• (1 + (2 rlfc -' - l)2™ fc ) . ..)) + ■ 



-/l(2 /2 -l)2 



"2-/2-1 



■(2 J *-1)2 I; 



l (l + (2" 



Proo/: Let DFA A { = (Qj,E,^,0,i^), 1 < i < fc. 
Construct E = {Qe,^,Se,Qo,Fe) such that 



-l)(l + (2 r; 



fe-1 



!)■■• (l + (2 n 



1)2"*)...)). 



Q E = Q\ x2° 2 x2* x ••• x2 Qt -£>'- £ 



i=l 



<70 = < 



(0,0,. ..,0), ifO0Fi, 1 <z< fc; 

(O,{O},...,0), ifOGFi andO^Fi,2<i</c; 



(0,{0},...,{0}}, ifO€F if 1 < « < fe — 1; 
Fe = {(ui,u 2 ,...,u k ) £ Q E I u k (lF k / 0}; 
$E '■ 8e{{ui,U2, ■ ■ ■ ,u k },a) = (ui,v^, . . . ,u' k ), for a G E, where 
ui = ^(u^a), 

f^ 2 (n 2 ,a)U{0}, if^GFj, 
1^2(^2,0)5 otherwise, 

'(5 Ai (^,a)U{0}, 
(^(u^a), otherwise, 



u 2 



u, 



for 3 < i < k 



where 



D' = Qi x {0} x (2* x ■ ■ ■ x 2 Qk - {0} fc ~ 2 ) + Qi x (2 Q2 - {0}) x {0} x (2 Q4 x ■ ■ ■ x 2 Qk - {0} fc ~ 3 ) + ■ • ■ 

+ Qi x (2 Q2 - {0}) x ■ ■ ■ x (2 Qfc - 2 - {0}) x {0} x (2 Qk - {0}); 
E[ = F x x ({0} fe_1 U (2 Q2 " {0} - {0}) x ({0} fe ~ 2 U (2* - {0}) x • • • x ({0} 2 U (2 Qfc -> - {0})2 Qfc )...)); 
E'2 = (Qi - Fi) x ((2 F - {0}) U2 Q2 "- F2 ) x ({0} fc - 2 U (2 Q3 - {Q} - {0}) x • • • x ({0} 2 U {2 Qk -' - {0})2 Qfc ) . . .)) 

+ F, x ((2 F2 - {0}) u2 Q2 ~ F2 - {0} ) x ({0} fc ^ 2 U (2 Q3 ~ {0} - {0}) x • • • x ({0} 2 U {2 Qk -' - {0})2 Qfe ) . . .)); 

E'i = (Qi-F,)x(2 92 - F2 -{0})x...x((2 F -{0})u2 Q '- F ) 

x ({0} fe ~* U (2 Qs+1 ~ {0} - {0}) x ■ ■ ■ x ({0} 2 U (2 Qfc -' - {0})2 Qfc )...)) 

H +Fi x ((2 F - {0}) U 2 Q2 - F2 ~{°>) x • • • x ((2 F - {0}) U 2 Q ^ F ^ {a} ) x ({0} fe_i 

U (2^ + '- {0} - {0}) x ■ ■ ■ x ({0} 2 U {2 Qk - 1 - {0})2 Qfc ) . . .))• 
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Intuitively, Qe is a set of /c-tuples whose first component is a state in Q\ and the zth component is a 
subset of states in Qi, 2 < i < k. 

Qe does not contain those fc-tuples whose ith component is and whose jth component is not 0, 
when 1 < i < j < k. D' is the set of them. 

Q e does not contain those fc-tuples whose first component is an element of F\ and whose second 
component is not (if it is then all the elements afterward have to be 0) and does not contain 0, either. 
E\ is the set of them. 

Q e does not contain those fc-tuples whose ith component contains one or more final states of DFA 
A{ and whose (i + l)th component is not (if it is then all the elements afterward have to be 0) and 
does not contain 0, when 2 < i < k — 1 , either. E[ is the set of them. 

Clearly, L{E) = L(A X )- ■ ■ L{A k ). Let \Q At \ = m and \F Ai \ = fi,l<i<k. 
Then E has n 1 2" 2+ - +n '= -D-E x -E 2 E k _ x states. □ 

Note that when each A4, 1 < i < k, has one final state, this upper bound is exactly the same as the 
lower bound stated in Theorem [5] Thus, this bound is tight and is the state complexity of the catenation 
of k regular languages. 

Although we have proved that this state complexity is tight, it is too long and complex to be intuitive 
and comprehensible. Let SCcAr{n\, . . . ,n k ) denote the state complexity of catenation of k languages 
accepted by ni-state, . . ., n^-state DFAs, respectively, m, . . . > 2. By observing the structure of the 
result, we can see that n{l ni ^ Vnk is a good approximation with the ratio bound 

SC C AT(ni,...,n k ) 

However, all our experiments show that the ratio bound for this approximation is less than 3, but we have 
not been able to prove it. 

6 Conclusion 

The new concept of state complexity approximation is introduced. It further advances the idea of state 
complexity estimation by including the ratio bound. The ratio bound gives a precise and intuitive mea- 
surement on the "quality" of the estimation. 

We show that state complexity approximation can play useful roles in two different cases. In the first 
case, the exact state complexities have not been obtained. They may be very difficult to obtain. However, 
approximation results with low ratio bounds can be obtained rather easily and they are good enough for 
practical purposes in general. In the second case, the exact state complexities have been proved. The 
approximations of those results with low ratio bounds can simplify the formulae of the complexities and 
make them more intuitive and easier to apply. 

Clearly, the state complexity approximation is a useful and important concept. We expect many new 
results on state complexity approximation will come out in the near future. 
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